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METHOD, APPARATUS AND ARTICLE OF MANUFACTURE 
FOR MANAGING A REUSABLE LINEAR ACCESS STORAGE MEDIUM 

BACKGROUND OF THE INVENTION 
Field of the Invention 

[0001] The invention relates to computers and computer memory. More particularly, 
the invention relates to a method, apparatus and article of manufacture for managing a 
reusable linear access storage medium. 

Background of the Related Art 

[0002] All computer systems store data on some type of storage medium having a 
finite storage capacity. The storage medium may be configured as a plurality of media 
blocks that are reusable. To facilitate the storage of data on such a limited capacity 
storage medium, an operating system assigns an "expiration date" to data blocks 
stored on the storage medium. A data block may comprise one data file, multiple data 
files or a portion of one data file. The use of expiration dates provides a measure of 
how long a particular data block is needed on the storage medium. The data block is 
considered active prior to the expiration date, but is considered expired after the 
expiration date. 

[0003] The storage medium is classified as a random access medium or a linear (or 
sequential) access medium depending on how data blocks are accessed on the 
storage medium by a storage access device. In a random access medium, data block 
is directly accessible without the need to access any other data block. The random 
access medium, e.g., DVD-RAM (digital video disk random access memory), is 
generally limited to fixed size media blocks to enable data access. In contrast, a linear 
or sequential access medium may require access to other data blocks before an 
access of a particular data block. Linear access medium (or linear medium), e.g., 
magnetic tapes, may have variable size media blocks. 
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[0004] For linear access mediums, the operating system may track or monitor the 
different media blocks containing active data blocks within a media set, i.e., a plurality 
of media blocks, by using a "media header label" for each media block. The media 
header label contains information on the first media block of the media set and a media 
block sequence (MBSQ) number of the particular media block. The media block 
sequence number is indicative of a position of the media block in the media set. For 
example, the first media block used to store data blocks would have the lowest MBSQ 
number, while the last media block used to store data blocks would have the highest 
MBSQ number in the media set. Moreover, the operating system also assigns data 
block sequence numbers for data blocks, stored in each media set. These data block 
sequence numbers are indicative of the order in storing the data blocks in the media 
set. 

[0005] Over time, some of these data blocks in the media set will become expired, 
i.e., the expiration date has passed for these data blocks. Although such expired data 
blocks are no longer used used by the computer system, the data blocks still occupy 
media blocks in the storage medium. As such, the operating system needs to remove 
the expired files such that these media blocks are reusable. 

[0006] However, removing expired data blocks from the media set causes problems 
in tracking data blocks in the media set. For example, when a media block is initially 
used or reused for storage of data blocks, a media header label is appended to the 
media block. The media header label contains a reference to the "first media block" in 
the media set used to store data blocks. If this first media block no longer contains 
active data, the first media block is identified for removal from the media set. After 
removal of this first media block from the media set, the media header label for the 
remaining media blocks would reference a media block, i.e., the first media block, that 
is no longer part of the media set. As such, the media header label is no longer 
correctly referencing the first media block in the updated media set. 
[0007] There are currently several possible approaches to resolve the problems 
associated with handling the media set in response to expired data blocks. One 
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approach requires a user to manually remove media blocks that no longer contain any 
active data blocks, i.e., media blocks that only contain expired data blocks, from the 
media set. The removed media blocks are placed in a scratch pool of media blocks 
available for reuse or future storage. Although this approach allocates media blocks 
are for immediate reuse, a problem exists when a media block removed as the first 
media block of one media set is then reused as the first media block of another media 
set. This causes media blocks from two different media sets to reference the first 
media block, and corresponding confusion for the operating system. Moreover, if the 
first media block is removed and subsequently reused in the first media set, the media 
block would still be erroneously referenced, even though it no longer the oldest media 
block used to store the oldest data blocks. 

[0008] A second approach stores new data blocks into a new media set. Media 
blocks in an existing media set are not affected or reused until all data blocks in that 
media set are expired. As such, this approach does not allocate media blocks for 
immediate reuse when that media block contains only expired data blocks. Moreover, 
if a last or last-added media block in the media set has not fully used its entire memory 
for storage, that unused memory is wasted. Thus, the approach fails to efficiently use 
all media blocks of a media set. 

[0009] Thus, there is a need to provide accurate and efficient management of media 
blocks and associated media sets in a reusable linear access storage medium. 

SUMMARY OF THE INVENTION 

[0010] The invention provides a method, apparatus and article of manufacture for 
managing a reusable storage medium having a plurality of media blocks. A particular 
media block is included in a media set if the media block stores at least a portion of 
one linearly ordered data block. In one embodiment, an active window table is created 
to monitor the media blocks in the media set. If all data blocks stored in a particular 
media block have expired, then the media block is identified for further processing. 
The identified media blocks are removed from the active window table. A 
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determination is made as to whether any identified media block is a reference media 
block that is an oldest media block in the media set. If the identified media block is the 
reference media block, the active window table is updated to indicate the reference 
media block as the oldest media block of the remaining media blocks in the active 
window table. 

[0011] An apparatus comprising a memory and a processor is also provided. The 
apparatus is used with a storage medium comprising a plurality of media blocks, where 
a particular media block is included in a media set if the media block is used to store at 
least a portion of a data block. The memory stores a data backup program used with 
an operating system. The processor performs a method upon executing the data 
backup program and the operating system. Initially, an active window table is created 
to monitor the media blocks in the media set. If all data blocks stored in a particular 
media block have expired, then the media block is identified for further processing. 
The identified media blocks are removed from the active window table. A 
determination is made as to whether any identified media block is a reference media 
block that is an oldest media block in the media set. If the identified media block is the 
reference media block, the active window table is updated to indicate the reference 
media block as the oldest media block of the remaining media blocks in the active 
window table. 

[0012] Additionally, a computer readable medium storing a software program is 
provided. The software program, when executed by a computer, causes the computer 
to perform a method. Initially, an active window table is created to monitor the media 
blocks in the media set. If all data blocks stored in a particular media block have 
expired, then the media block is identified for further processing. The identified media 
blocks are removed from the active window table. A determination is made as to 
whether any identified media block is a reference media block that is an oldest media 
block in the media set. If the identified media block is the reference media block, the 
active window table is updated to indicate the reference media block as the oldest 
media block of the remaining media blocks in the active window table. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

[0013] The teachings of the present invention can be readily understood by 
considering the following detailed description in conjunction with the accompanying 
drawings, in which: 

[0014] FIG. 1 depicts a block diagram of a computer system utilized to implement the 
present invention; 

[0015] FIG. 2A illustrates the Storage of data blocks in a media set having a plurality 
of media blocks in the storage medium; 

[0016] FIG. 2B illustrates the media set of FIG. 2A after expiration of some of data 
blocks; 

[0017] FIG. 2C illustrates the media set of FIG. 2B after storing additional data 
blocks; 

[0018] FIG. 2D illustrates the media set of FIG. 2C after reusing available media 
blocks on a scratch pool list; 

[0019] FIG. 3A depicts an active window table used for tracking the storage medium 
of FIG. 2A; 

[0020] FIG. 3B depicts the active window table for the media set of FIG. 2B; 
[0021] FIG. 3C depicts the active window table for the media set of FIG. 2C; 
[0022] FIG. 3D depicts the active window table for the media set of FIG. 2D; and 
[0023] FIG. 4 depicts a flow chart of a method to implement the data backup 
program; 

[0024] FIG. 5 depicts a flow chart of a method to allocate media blocks for storing 
data blocks; and 

[0025] FIG. 6 depicts a flow chart of a method to update an active window table. 
[0026] To facilitate understanding, identical reference numerals have been used, 
where possible, to designate identical elements that are common to the figures. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 
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[0027] FIG. 1 depicts an illustrative computer system 100 utilized in accordance with 
the present invention. The computer system 100 may represent any type of computer, 
computer system or other programmable electronic device, including a client computer, 
a server computer, a portable computer, an embedded controller, and the like. The 
computer system 100 may be a standalone device or coupled to a computer network 
system. In one embodiment, the computer system 100 is an AS/400 available from 
International Business Machines of Armonk, New York. 

[0028] The computer system 100 is shown in a multi-user programming environment 
having at least one processor 102, which obtains instructions and data from a main 
memory 106 via a bus 104. The main memory 106 includes an operating system 108 
and at least one computer program, e.g., a data backup program 110. In addition, the 
main memory 106 may contain various data structures including an active window 112 
and a scratch pool 113. The main memory 106 may comprise one or a combination of 
memory devices, including Random Access Memory, nonvolatile or backup memory, 
(e.g., programmable or Flash memories, read-only memories, and the like). In 
addition, memory 106 may include memory physically located elsewhere in a computer 
system 100, for example, any storage capacity used as virtual memory or stored on a 
mass storage device or on another computer coupled to the computer system 100 via 
bus 104. 

[0029] The computer system 1 00 is generally coupled to a number of peripheral 
devices. In one embodiment, the computer system 100 is illustratively coupled to a 
storage medium 116, input devices 120, output devices 126, and a plurality of 
networked devices 124. Each of these peripheral devices is operably coupled to the 
computer system 100 via respective interfaces. A storage interface 114 couples the 
computer system 1 00 to the storage medium 1 1 6. One example of the mass storage 
interface 1 14 is a disk drive. An input/output interface 118 couples the computer 
system 100 to the input devices 120 and output devices 126. Additionally, a network 
interface 122, e.g., a network interface card, couples the computer system 100 to the 
plurality of networked devices 1 24. 
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[0030] The storage medium 1 16 may comprise either a permanent or removable 
direct access storage device (DASD) of finite storage capacity. The storage medium 
1 1 6 may be a nonvolatile or backup memory. In one embodiment, the storage 
medium 1 16 is configured as a plurality of media blocks that are either used, e.g., 
currently storing data, or unused. Each media block, also known as a "volume," is 
used to store data blocks or blocks of information associated with data files. The 
storage medium 1 16 in the present invention is a linear access medium. In a linear 
access medium, the storage interface 1 14, e.g., a disk drive or tape drive, may access 
other data blocks prior to accessing a particular data block. One example of the linear 
access medium 1 16 is a magnetic optical (MO) medium. 

[0031] To maximize the utility of the finite storage capacity of the storage medium 
116, the media blocks are reusable. Expiration dates are assigned to the data blocks 
by the operating system 108. The use of expiration dates identifies how long a data 
block is needed on the storage medium 116. As such, if a particular media block is 
only storing expired data blocks, that memory block may be reused to store additional 
data blocks. The storage of data blocks and reuse of media blocks is further described 
with respect to FIGS. 2A-2D. 

[0032] The operating system 108 is the software utilized to operate the computer 
system 100. More specifically, the operating system 108 performs a variety of 
functions including coordinating the storage of data blocks in the storage medium 116 
and using expiration dates tracking the stored data blocks. The operating system 108 
uses the expiration dates with the active window 1 12 to coordinate the reuse of media 
blocks in the storage medium 116. One exemplary operating system 108 is the IBM 
AS/400. However, other operating systems 108 capable of using expiration dates to 
track data blocks are also applicable. Such operating systems 108 may include UNIX, 
IBM AIX, Microsoft Windows, and the like. 

[0033] The data storage program 1 10 is a computer program that enables the 
operating system 108 to optimize the storage of data blocks in the storage medium 
116. One embodiment of the data storage program 1 1 0 is a particular feature or option 
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of the IBM BRMS/400 (BRMS: Backup Recovery Media Services). To optimize the 
storage of data blocks, the data storage program 110 uses an active window table 112 
and a scratch pool list 113. In one embodiment, the program 110 maintains active 
window table 112 and the operating system 108 maintains the scratch pool list 113. 
The active window table 1 12 is a data structure for identifying data blocks that are 
currently active, i.e., not expired, and media blocks used to Store active or unexpired 
data blocks. Operation of the active window table 1 12 is further described with respect 
to FIGS. 3A-3D. The scratch pool list 1 13 is a data structure for identifying media 
blocks that are available for storing data blocks. Such media blocks may include 
unused media blocks or previously used media blocks that contain expired data blocks. 
[0034] The input devices 120 may comprise any device utilized to provide input to the 
computer system 100. Examples of input devices 120 include a keyboard, a keypad, a 
light pen, a touch screen, a button, a mouse, a track ball, a speech recognition unit, 
and the like. The output devices 126 may comprise any conventional display screen. 
Although shown separately from the input devices 120, the output devices 126 and 
input devices 1 20 could be combined. For example, a display screen with an 
integrated touch screen, and a display with an integrated keyboard, or a speech 
recognition unit combined with a text speech converter could be used. 
[0035] In general, the routines executed to implement the embodiments of the 
invention, whether implemented as part of an operating system or a specific 
application, component, program, object, module or sequence of instructions will be 
referred to herein as the data backup program 1 10, or simply as the program 1 1 0. The 
program 110 typically comprises one or more instructions that are resident at various 
times in various memory and storage devices in the computer system 100. When read 
and executed by one or more processors 102 in the computer system 100, the 
program 110 causes that computer system 100 to perform the steps necessary to 
execute steps or elements embodying the various aspects of the invention. Moreover, 
while the invention has and hereinafter will be described in the context of fully 
functioning computers and computer systems, those skilled in the art will appreciate 



-9- 



ROC920000203US1 

Express Mai! No. EL684621135US 

that the various embodiments of the invention are capable of being distributed as a 
program product in a variety of forms, and that the invention applies equally regardless 
of the particular type of signal bearing or computer readable media used to actually 
carry out the distribution. Examples of signal bearing or computer readable media 
include, but are not limited to, recordable type media such as volatile and nonvolatile 
memory devices, floppy and other removable disks, hard disk drives, optical disks 
(e.g., CD-ROM, DVD, and the like), among others. Examples of transmission type 
media such as digital and analog communication links. 

[0036] In addition, various programs and devices described hereinafter may be 
identified based upon the application for which they are implemented in a specific 
embodiment of the invention. However, it should be appreciated that any particular 
program or device nomenclature that follows is used merely for convenience, and the 
invention is not limited to use solely in any specific application identified and/or implied 
by such nomenclature. 

[0037] FIG. 2A illustrates a plurality of media blocks 202^ 202 2 , 202 n (generally 
referred as 202 o ) used to store a plurality of data blocks 204 1( 204 2 , 204 m (generally 
referred as 204 m ). The number of media blocks 202 n (also known as volumes or 
storage blocks) is a finite number for a storage medium 1 1 6 of limited capacity. 
Illustratively, nine media blocks 202i, 202 2 , 202 9 are initially used to store thirteen 
linearly ordered data blocks 204i, 204 2 , 204 13 . In one embodiment, the media 
blocks 202 n are assigned a media blocks sequence (MBSQ) identifier by the operating 
system 108. The media blocks 202 n are ordered in the order used to store data blocks 
204 m . For example, MBSQ 1 is the identifier for the first media block 202i, MBSQ 2 is 
the identifier for the second media block 202 2 and MBSQ n is the identifier for the n-th 
media block 202 n . 

[0038] For a linear access medium, an access of a particular data block 204 m 
requires an access to previously stored data blocks 204 m . As such, older data blocks 
204 m cannot be removed from the media blocks 202n without invalidating the access to 
later stored data blocks 204 m . A data structure used to track the media blocks 202 n 
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and data blocks 204 m is herein referred as an active window table 112. The active 
window table 1 12 is further described with reference to FIGS. 3A-3D. All unused data 
blocks are monitored in a data structure, e.g., a scratch pool list 1 13, by the data 
backup program 110. 

[0039] The size of the media blocks 202 n and data blocks 204 m may be variable. One 
media block 202 n may store multiple data blocks. For example, media block A 202i 
stores data block 1 204i, data block 2 204 2 and a portion of data block 3 204 3 . Also, 
multiple media blocks 202 n may be required to store one data block 204 m . For 
example, data block 5 204 5 is stored in or is spanned over at least portions of five 
media blocks D, E, F, G and H, 202 4 to 202 8 . 

[0040] To manage the use of the media blocks 202 n , the operating system 1 08 
assigns an expiration date for each data block 204 m . The expiration date indicates the 
length of time a data block 204 m is active or usable in the storage medium 116. If an 
expiration date for the data block 204 m has not passed, the data block 204 m remains 
active, e.g., usable by the computer system 100. However, once the expiration data 
for the data block 204 m has passed, the data block is no longer active, i.e., expired. 
Such an expired data block is usable but has decreased relevance, e.g., stale or 
untimely information, in the computer system 100. If a media block 202 n contains only 
expired data blocks 204 m , then the operating system 108 would allocate that particular 
media block 202 n for reuse. 

[0041] A collection of linearly ordered media blocks 202 n is herein referred to as a 
"media set" 200. If al! the media blocks 202 n in the media set 200 stores at least a 
portion of an active data block 204 m , the media set 200 is a " dynamic media set" 200. 
The dynamic media set 200 changes if additional media blocks 202 n are needed or if 
media blocks 202 n in the media set 200 may be reused. For example, additional media 
blocks 202 n are required to store additional data blocks 204m. Additionally, media 
blocks 202 n need to be reused if all the data blocks 204 m in the media set 200 become 
expired, i.e., after the expiration date of all the data blocks 204 m in a particular media 
block 202 n has passed. To track the media blocks 202 n in a particular media set 200, 
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the operating system 108 assigns a "media header label" 206 n for each media block 
202 n . The media header label 206 n contains an identifier of the first media block 202i 
in the media set 200 and the MBSQ number indicative of the position of a particular 
media block 202 n within the media set 200. The first media block 202i in the media set 
200 is also referred to as a "reference data block." In the example of FIG. 2, the media 
set 200 illustratively comprises nine media blocks 202i, 202 2 , 202 9) and first media 
block 202-i is media block A. 

[0042] FIG. 2B depicts the media set 200 after expiration of some data blocks 204 m . 
Illustratively, the first five data blocks 204i, 204 2 , 204 3 , 204 4 and 204 5 have now 
expired, i.e., the expiration dates have passed for these blocks 204i, 2042, 2043, 2044 

and 204 5 . These seven media blocks 202i, 202 2 202 7 now contain only expired 

data blocks 204 m . To maximize reuse of the storage medium, the operating system 
108 must remove these media blocks 202i, 202 2 , 202 7 from the media set 200. 
However, if media blocks 202i, 202 2 , 202 7 are removed from the media set 200 and 
allocated for reuse, the other media header label 206 8 and 206 9 information is no 
longer accurate for the updated media set 200 comprising media blocks 202 8 and 202 9 . 
Namely, the media header labels 206 8 and 206 9 no longer accurately describe the 
media set 200, since the media header labels 206 8 and 206 9 now point or reference a 
media block 202i that is no longer part of the media set 200. Moreover, alteration of 
the media header labels 206s and 206g is impossible without destroying everything 
else, e.g., data blocks 204 6 to 204i 3 , in the media blocks 202 8 and 202 9 . 
[0043] The use of incorrect media header labels 206 n may cause problems for the 
operating system 1 08 in tracking the media set 200. If the first media block 202 t of the 
original media set 200 is reused in the beginning of another media set 200, then the 
first media block 202i is referenced in two separate media sets 200. The operating 
system 108 would then have problems differentiating between various media blocks 
202 n in the two media sets 200. If the first media block 202i is reused in the original 
media set 200 to store additional data, then the reused media block 202i would be 
erroneously referenced as it is no longer the reference media block 202 n . 
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[0044] To eliminate the problems in reusing media blocks 202 n in the operating 
system 108, the active window 1 12 is used to manage the media blocks 202 n in the 
media set 200. The active window 1 12 for the example of FIG. 2B is further described 
with respect to FIG. 3B. 

[0045] FIG. 2C depicts the media set 200 after storing an additional data block 204i 4 . 
To maximize the use of storage medium 116, any available storage memory in the last 
media block 2029 is used to store any additional data blocks 204i 4 . If additional 
storage memory is required to store additional media blocks 202 n , then additional 
media blocks 202 n are retrieved from available storage memory identified in the scratch 
pool 113. 

[0046] FIG. 2D depicts the media set 200 having additional media blocks 202 n to 
store additional data blocks 204 m . Illustratively, additional memory is required to store 
an additional data block 204i 5 . The additional memory to store the new data block 
204-I5 is retrieved from available storage memory identified in the scratch pool list 113. 
Illustratively, previously used media blocks 202 2 , 202 3 , 202 4 and 202 5 are retrieved 
from the scratch pool 113 and reused to store the new data block 204i 5 - 
[0047] FIG. 3A depicts the active window table 112 used for tracking the media set 
200 of FIG. 2A. The active window table 1 12 is a data structure that maintains 
information on the dynamic media set 200. As described above, the dynamic media 
set 200 is a set of media blocks 202 n that stores currently active data blocks 204 m . 
Thus, the dynamic media set 200 changes if data blocks 204 m become expired or if 
additional data blocks 204 m are stored in a particular media set 200. In one 
embodiment, each entry or row in the active window table 112 represents a unique 
data block 204 m and media block 202 n combination. For example, media block A 202i 
stores two data blocks 204i and 204 2 and a portion of a third data block 204 3 . The 
active window table 112 illustratively contains separate entries for each different data 
block 204 m in the media block 202i. 

[0048] The active window table 112 contains a plurality of fields that contain 
information for each entry or row. In one embodiment, the active window table 112 
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comprises a data block sequence identifier 302, a media block identifier 304, a first 
media block (BegMB) identifier 306, a media block sequence (MBSQ) identifier 308, 
total media blocks identifier 310 and an expiration date identifier 312. 
[0049] The data block sequence identifier 302 comprises a position of a particular 
data block 204 m in the linearly ordered sequence of data blocks 204 m in the dynamic 
media set 200. The media block identifier 304 comprises an identifier or name of a 
particular media block 202 n . The first media block identifier 306 comprises the 
identifier of the first or oldest media block 202 n or reference media block currently in 
the media set 200. In contrast to the media header label 206 n , the first media block 
identifier 306 contains the identifier of the media block 202 n that is the first in the media 
set 200, i.e., the reference media block. As such, the use of the active window table 
112 instead of the media header label 206 n enables the data backup program 110 to 
properly track the reference media block and manage the reuse of media blocks 202 n 
in the storage medium 1 1 6. 

[0050] The MBSQ identifier 308 comprises a position of a particular media block 202 n 
in the linearly ordered sequence of media blocks 202 n . The total media blocks 
identifier 312 comprises the total number of media blocks 202 n in a dynamic media set 
200. The value of the identifier 312 changes as media blocks 202 n are added and 
removed from the media set 200. The expiration date identifier 312 comprises the 
expiration date of a particular data block 204 m . The identifier 312 is used to update the 
media set 200. More specifically, if all the data blocks 204 m within a particular media 
block 202 n has expired, then the media block 202 n is removed from the media set 200 
and the active window table 112. 

[0051] FIG. 3B depicts the active window table 1 1 2 for the media set 200 of FIG. 2B. 
The data backup program 110 updates the active window table 1 1 2 to reflect the 
current status of the dynamic media set 200. In the example of FIG. 2B, the first five 
data blocks 202i, 202 2) ..., 202 5 have expired. This occurs after the expiration dates of 
these data blocks 202i, 202 2 , 202 5 have passed, e.g., on March 14. If the 
expiration date identifier 314 is passed, then the corresponding row or entry is removed 
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from active window 112. The remaining entries in the active window table 112 are 
updated to indicate that the first seven media blocks 202i, 202? are no longer part 
of the media set 200. If a particular media blocks 202 n contains only expired data 
blocks, the media block 202 n is placed in the scratch pool list 113 for future reuse. 
[0052] In contrast to the media header labels 206 8 and 206 9 , the first media block 
identifier 308 is updated to contain the media block identifier 306 of the oldest 
remaining media block 202 8 or reference data block in the media set 200. In one 
embodiment, all media blocks 202 n in the media set 200 have the same value for the 
BegMB identifier 306. The data backup program 110 uses information in the first 
media block identifier 306 to manage the allocation and reuse of media blocks 202 n of 
the storage medium 116. In addition to the first media block identifier 306 field, the 
data block sequence identifier 302, the media block sequence identifier 308, and the 
total media block identifier 310 are also updated. 

[0053] FIG. 3C depicts the active window table 1 1 2 for the media set 200 of FIG. 2C. 
Specifically, the data backup program 110 updates the active window table 1 12 to 
contain information for the most recently added data block 204i 4 . Additionally, the 
active window table 1 12 is updated in response to the expiration of other data blocks 
204 6 , 204 7 , 204 8 and 204 9 in media block H 202 8 , e.g., the date is now April, 20. Note 
that the media set 200 still contains an active data block 204 10 . Although the expired 
data blocks 204 6 , 204 7 , 204 8 and 204 9 are no longer part of the active window table 
112, these data blocks 204 m may remain in the media block 202 8 until the expiration of 
the last data block 204i 0 in the media block 202 8 . 

[0054] FIG. 3D depicts the active window table 1 1 2 for the media set 200 of FIG. 2D. 
To store the additional data block 204i 5 , the remaining storage memory in media block 
I 202 9 is used as well as additional media blocks 202 2 , 202 3 , 202 4 and 202 5 from the 
scratch pool 112. The updated active window 112 illustratively has five additional 
entries, i.e., one entry for each media block 202 n used to store the additional data block 
204 15 . 

[0055] FIG. 4 depicts a flow chart of a method 400 to implement the data backup 
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program 110. In one embodiment, the method 400 is implemented in the data backup 
program 11 0 in combination with the operating system 1 08. The method 400 updates 
an active window table 112 used to track the dynamic media set 200 comprising of 
media blocks 202 n containing active or non-expired data blocks 204 m . By tracking the 
dynamic media sets 200, the method 400 manages the allocation of storage memory in 
the reusable storage medium 1 1 6. 

[0056] In current implementations, the operating system 108 cannot modify 
information in the media header label 206 n without invalidating the entire media set 
200. The use of the active window table 1 12 to manage media sets 200 of reusable 
media blocks 202 n enables the program 110 and the operating system 108 to efficiently 
use the storage medium 116. Media blocks 202 n no longer containing active data 
blocks 204 m are marked for reuse. Otherwise, reuse of memory in an existing media 
set 200 is not possible until all data blocks 204 m in all media blocks 202 n of the media 
set 200 have expired. 

[0057] Specifically, the method 400 starts at step 402 and proceeds to step 404 
where a query determines whether any new media sets 200 have been created. In one 
embodiment, the media sets 200 in the storage medium 1 16 are created by the 
operating system 108. If new media sets 200 have been created, the method 400 
proceeds to create an active window table 1 12 for each new media set 200 and returns 
to step 404. If no new media sets 200 have been created, the method 400 proceeds to 
step 408, where a query determines whether there is a request to write to the storage 
medium 116. If there is no request to write to the storage medium 1 1 6, the method 
proceeds to step 41 6. 

[0058] If there is a request to write to the storage medium 1 1 6, the method 400 
proceeds to step 410 where media block(s) 202 n are allocated for each data block 
204 m to be stored. Step 410 is further described with respect to FIG. 5. After 
allocating media blocks 202 n at step 410, the method 400 proceeds to confirm storage 
of the data blocks 204 m at step 412, updates the active window table 1 12 at step 414 
and returns to step 404. Step 414 adds entries for the new data blocks 204 m . An 
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example of storing data blocks 204 m to an existing media set 200 was described with 
reference to FIG. 2C. An example of an active window table 112 tracking these 
changes was described with reference to FIG. 3C. 

[0059] At step 41 6, a query determines whether it is time to update the active window 
table 112. In one embodiment, such an update is performed periodically, e.g., every 
24 hours. If the active window table 1 12 is to be updated, the method 400 proceeds to 
update the active window table at step 418 and returns to step 404. Step 418 is further 
described with respect to FIG. 6. If the active window table 1 12 is not to be updated, 
the method 400 proceeds to step 420. At step 420, a query determines whether the 
program 110 Should continue. If the program 110 should continue, the method 400 
returns to step 404. If the program 110 should end, the method 400 exits at step 422. 
[0060] FIG. 5 depicts a flow chart of a method 500 to allocate media blocks 202 n for 
storing data blocks 204 m . The method 500 is embodied as step 410 described above 
with reference to FIG. 4. An example of allocating media blocks 202 n to store 
additional data blocks 204 m was described with reference to FIG. 2D. An example of 
an active window table 1 12 tracking these changes was described with reference to 
FIG. 3D. Specifically, the method 500 starts at step 502 and proceeds to step 504 to 
process each data block 204 m to be stored. The method 500 proceeds to step 506 
where each media block(s) 202 n used to store a particular data block 204 m is 
processed. In processing each media block 202 n , the method 500 proceeds to step 
508 where a query determines whether to store a particular data block 204 m in an 
existing media set 200. If the data block 204 m is not to be stored in an existing media 
set 200, the method 500 proceeds to select a media block 202 n from the scratch pool 
list 1 13 at step 510 and then to step 522. 

[0061] If the data block 204 m is to be stored in an existing media set 200, the method 
500 proceeds to step 51 2, where a query determines whether a limit of data blocks 
204 m is reached. The limit represents a maximum possible number of data blocks 
204 m in the media set 200. The value of this limit is also constrained by the operating 
system 108. For a 32-bit processor 102, an exemplary value is two to the thirty-second 
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power or approximately 16.7 million. If the limit is reached, the method 500 proceeds 
to select a media block 202 n from the scratch pool list 1 13 at step 51 4 and then 
proceeds to step 522. If the limit is not reached, the method 500 proceeds to step 516 
[0062] At Step 51 6, a query determines whether a limit of media blocks 202 n is in 
danger of being reached. The limit represents a maximum possible number of media 
blocks 204 m in the media set 200. The value of the limit is constrained by the 
operating system 108. For example, if the exemplary limit is 9999, then step 516 may 
anticipate the limit at 9000 media blocks 202 n . If the limit is in danger of being 
reached, the method 500 proceeds to select a media block 202 n from the scratch pool 
list 1 13 at step 518 and proceeds to step 522. If the limit is not in danger of being 
reached, the method 500 proceeds to step 520 where the last media block 202 n in the 
media set 200 is selected. After this step 520, the method 500 proceeds to step 522. 
[0063] At step 522, the method 500 notifies the selection of the media block 202n 
from steps 510, 514, 518 and 520 to the operating system 108. The operating system 
108 would then allocate the media block 202 n for storage of the particular media block 
202 n being considered. Once all the media blocks 202 n are processed for a particular 
data block 204 m at step 506, the method 500 proceeds back to step 504 where the 
next data block 204 m is processed. After processing all the data blocks 204 m to be 
stored at step 504, the method 500 ends at step 524. 

[0064] FIG. 6 depicts a flow chart of a method 600 to update an active window table 
112. The method 600 is embodied as step 418 described above with reference to FIG. 
4. The method 600 updates the active window table 1 12 to account for data blocks 
204 m that have expired. An example of an updated media set 200 to account for 
expired data blocks 204 m was described with reference to FIG. 2B. An example of an 
active window table 1 1 2 tracking these changes was described with reference to FIG. 
3B. Specifically, the method 600 starts at step 602 and proceeds to process each 
media set 200 at step 604. The method 600 then proceeds to step 606 where each 
media block 202 n in a particular media set 200 is processed. In one embodiment, step 
606 processes all media block sequence identifiers 306 in the active window table 112. 



-18- 



ROC920000203US1 

Express Mail No. EL684621135US 

[0065] At step 608, a query determines whether the media block 202 n contains only 
expired data blocks 204 m . Namely, step 608 determines whether to remove the media 
block 202 n from the active window table 1 12 to the scratch pool 113. If the media block 
202 n contains at least a portion of an active data block 204 m , i.e., does not only contain 
expired data blocks 204 m , the method 600 proceeds to process the next media block 
202 n at step 606. If the media block 202 n contains only expired data blocks 204 m , the 
method 600 proceeds to step 610 where entries in the active window table 112 are 
removed for the media block 202 n . The method 600 then proceeds to add the media 
block 202 n to the scratch pool list 1 13 at step 61 2 and returns to process the next 
media block 202 n at step 606. 

[0066] Once all the media blocks 202 n in the media set 200 are processed, the 
method 400 proceeds to step 614 where the active window table 1 12 is updated to 
remove entries for expired data blocks 204 m . The method 400 then proceeds to the 
next media set 200 at step 604. Once all the media sets 200 are processed, the 
method 600 proceeds to end at step 61 6. 

[0067] Although various embodiments which incorporate the teachings of the present 
invention have been shown and described in detail herein, those skilled in the art can 
readily devise many other varied embodiments that still incorporate these teachings. 
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